Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement

نویسندگان

چکیده

End-to-end speech-to-text translation (E2E-ST) is becoming increasingly popular due to the potential of its less error propagation, lower latency, and fewer parameters. Given triplet training corpus〈speech, transcription, translation〉, conventional high-quality E2E-ST system leverages the〈speech, transcription〉pair pre-train model then utilizes translation〉pair optimize it further. However, this process only involves two-tuple data at each stage, loose coupling fails fully exploit association between data. In paper, we attempt joint probability transcription based on speech input directly leverage such Based that, propose a novel regularization method for improve agreement dual-path decomposition within data, which should be equal in theory. To achieve goal, introduce two Kullback-Leibler divergence terms into objective reduce mismatch output probabilities dual-path. Then well-trained can naturally transformed as models by pre-defined early stop tag. Experiments MuST-C benchmark demonstrate that our proposed approach significantly outperforms state-of-the-art baselines all 8 language pairs while achieving better performance automatic recognition task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

End-to-End Automatic Speech Translation of Audiobooks

We investigate end-to-end speech-to-text translation on a corpus of audiobooks specifically augmented for this task. Previous works investigated the extreme case where source language transcription is not available during learning nor decoding, but we also study a midway case where source language transcription is available at training time only. In this case, a single model is trained to decod...

متن کامل

End-to-End Evaluation in JANUS: A Speech-to-speech Translation System

JANUS is a multi-lingual speech-to-speech translation system designed to facilitate communication between two parties engaged in a spontaneousconversation in a limited domain. In this paper we describe our methodology for evaluating translation performance. Our current focus is on end-to-end evaluations the evaluation of the translation capabilities of the system as a whole. The main goal of ou...

متن کامل

An Experimental Methodology for an End-to-End Evaluation in Speech-to-Speech Translation

This paper describes the evaluation methodology used to evaluate the TC-STAR speech-to-speech translation (SST) system and their results from the third year of the project. It follows the results presented in (Hamon et al., 2007), dealing with the first end-to-end evaluation of the project. In this paper, we try to experiment with the methodology and the protocol during the second end-to-end ev...

متن کامل

Comparison of nerve repair with end to end, end to side with window and end to side without window methods in lower extremity of rat

  Abstract   Background : Although, different studies on end-to-side nerve repair, results are controversial. The importance of this method in case is unavailability of proximal nerve. In this method, donor nerves also remain intact and without injury. In compare to other classic procedures, end-to-side repair is not much time consuming and needs less dissection. Overall, the previous studies i...

متن کامل

End-to-End Evaluation of a Speech-to-Speech Translation System in TC-STAR

The paper describes an evaluation methodology to evaluate speech-to-speech translation systems and their results. The evaluation scheme uses questionnaires filled in by human judges for addressing the adequacy and fluency of audio translation outputs and was applied in the second TC-STAR evaluation campaign. The same evaluation methodology is carried out both on the outputs of an automatic syst...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2022

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v36i10.21303